Taming Near Repeat Calculation for Crime Analysis via Cohesive Subgraph Computing

نویسندگان

  • Zhaoming Yin
  • Xuan Shi
چکیده

Near repeat (NR) is a well known phenomenon in crime analysis assuming that crime events exhibit correlations within a given time and space frame. Traditional NR calculation generates 2 event pairs if 2 events happened within a given space and time limit. When the number of events is large, however, NR calculation is time consuming and how these pairs are organized are not yet explored. In this paper, we designed a new approach to calculate clusters of NR events efficiently. To begin with, R-tree is utilized to index crime events, a single event is represented by a vertex whereas edges are constructed by range querying the vertex in R-tree, and a graph is formed. Cohesive subgraph approaches are applied to identify the event chains. k-clique, k-truss, kcore plus DBSCAN algorithms are implemented in sequence with respect to their varied range of ability to find cohesive subgraphs. Real world crime data in Chicago, New York and Washington DC are utilized to conduct experiments. The experiment confirmed that near repeat is a solid effect in real big crime data by conducting Mapreduce empowered knox tests. The performance of 4 different algorithms are validated, while the quality of the algorithms are gauged by the distribution of number of cohesive subgraphs and their clustering coefficients. The proposed framework is the first to process the real crime data of million record scale, and is the first to detect NR events with size of more than 2. Keywords-Near Repeat, Graph Analysis

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Non-Homogeneous Diffusion of Residential Crime in Urban China

The relationship between crime and urban environment has always been the focus of crime geography. Like diseases which can transmit and diffuse, crimes may also spread during a certain period of time and to a certain area by the near-repeat effect. Traditional near-repeat analysis focuses on the spatial spread of crimes to adjacent areas, with little regard to the displacement effect. Crime dis...

متن کامل

Analysis of Burglary Hot Spots and Near-Repeat Victimization in a Large Chinese City

A hot spot refers to numerous crime incidents clustered in a limited space-time range. The near-repeat phenomenon suggests that every victimization might form a contagion-like pattern nearby in terms of both space and time. In this article, the near-repeat phenomenon is used to analyze the risk levels around hot spots. Utilizing a recent burglary dataset in N (a large city located in southeaste...

متن کامل

Evaluating the Impact the Weekday Has on Near-Repeat Victimization: A Spatio-Temporal Analysis of Street Robberies in the City of Vienna, Austria

The near-repeat phenomenon refers to the increased risk of repeat victimization not only at the same location but at nearby locations up to a certain distance and for a certain time period. In recent research, near-repeat victimization has been repeatedly confirmed for different crime types such as burglaries or shootings. In this article the near-repeat phenomenon is analyzed for each day of t...

متن کامل

Investigating the Repeat and Near-Repeat Patterns in Sub- categories of burglary crime

The investigation of repeat and near-repeat (RNR) patterns of sub-categories of burglary crimes is of great importance to law enforcement since a distinct intervention strategy may be suitable in the application to a different sub-category. In this study, the Knox test is used to investigate the RNR patterns within the data set of three different subcategories of burglary crime in relation to t...

متن کامل

Large Scale Cohesive Subgraphs Discovery for Social Network Visual Analysis

Graphs are widely used in large scale social network analysis nowadays. Not only analysts need to focus on cohesive subgraphs to study patterns among social actors, but also normal users are interested in discovering what happening in their neighborhood. However, effectively storing large scale social network and efficiently identifying cohesive subgraphs is challenging. In this work we introdu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1705.07746  شماره 

صفحات  -

تاریخ انتشار 2017